Multi-Level Weighted Sequential Pattern Mining Based on Prime Encoding
نویسندگان
چکیده
Encoding can express the hierarchical relationship in the area of mining the multi-level sequential pattern, up to now all the algorithms of which find frequent sequences just according to frequency, but items have different importance in the real applications, therefore the weight constraint involved to the entire mining process is crucial. The MWSP algorithm based on the candidate generation-and-test approach is one of the best algorithms of the weighted sequential pattern mining, however, which will easily generate the situation of candidate combinatorial explosion during the mining process. Therefore, this paper presents the algorithm PMWSM, which adopts prime encoding to decide the parent-child relationship between different levels by one step of division operation, introduces the concept of K-minimum weighted support count to push weight constraint into the multi-level sequential pattern mining, utilizes the principle of prefix projection database to avoid the occurrence of candidate combinatorial explosion, and takes full advantage of the minimum weighted support count to optimize the algorithm. The experimental results show that the algorithm PMWSM is more effective than the algorithm MWSP on mining multi-level weighted sequential patterns from the sequence database.
منابع مشابه
Sequential Pattern Mining by Pattern-Growth: Principles and Extensions
Sequential pattern mining is an important data mining problem with broad applications. However, it is also a challenging problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences. Recent studies have developed two major classes of sequential pattern mining methods: (1) a candidate generation-and-test approach, represented by (i) GSP...
متن کاملPrism: An effective approach for frequent sequence mining via prime-block encoding
Article history: Received 30 September 2007 Received in revised form 4 June 2008 Available online 23 May 2009
متن کاملA Novel Weighted Support Method for Access Pattern Mining
Sequential Pattern Mining is an important data mining technique that finds out all frequent sequential patterns in a sequence database. Applications in wide range of important domains make Sequential Pattern Mining an interesting area of research. Conventional approach for sequential pattern mining treats each and every item in the sequence with equal importance and thus fails to reflect the in...
متن کاملPerformance Analysis of web page recommendation algorithm based on weighted sequential patterns and markov model
Web usage mining techniques helps the users to predict the required Web page recommendations. In recent times, there has been a considerable significance given to sequential mining approaches to construct web page recommendation systems. This paper focuses on developing a web page recommendation approach for accessing related web pages more efficiently and effectively using weighted sequential ...
متن کاملMining rare sequential patterns with ASP
This article presents an approach of meaningful rare sequential pattern mining based on the declarative programming paradigm of Answer Set Programming (ASP). The setting of rare sequential pattern mining is introduced. Our ASP approach provides an easy manner to encode expert constraints on expected patterns to cope with the huge amount of meaningless rare patterns. Encodings are presented and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- JDCTA
دوره 4 شماره
صفحات -
تاریخ انتشار 2010